Random Feature Mapping with Signed Circulant Matrix Projection
نویسندگان
چکیده
Random feature mappings have been successfully used for approximating non-linear kernels to scale up kernel methods. Some work aims at speeding up the feature mappings, but brings increasing variance of the approximation. In this paper, we propose a novel random feature mapping method that uses a signed Circulant Random Matrix (CRM) instead of an unstructured random matrix to project input data. The signed CRM has linear space complexity as the whole signed CRM can be recovered from one column of the CRM, and ensures loglinear time complexity to compute the feature mapping using the Fast Fourier Transform (FFT). Theoretically, we prove that approximating Gaussian kernel using our mapping method is unbiased and does not increase the variance. Experimentally, we demonstrate that our proposed mapping method is time and space efficient while retaining similar accuracies with state-of-the-art random feature mapping methods. Our proposed random feature mapping method can be implemented easily and make kernel methods scalable and practical for large scale training and predicting problems.
منابع مشابه
A Geometry Preserving Kernel over Riemannian Manifolds
Abstract- Kernel trick and projection to tangent spaces are two choices for linearizing the data points lying on Riemannian manifolds. These approaches are used to provide the prerequisites for applying standard machine learning methods on Riemannian manifolds. Classical kernels implicitly project data to high dimensional feature space without considering the intrinsic geometry of data points. ...
متن کاملOn Binary Embedding using Circulant Matrices
Binary embeddings provide efficient and powerful ways to perform operations on large scale data. However binary embedding typically requires long codes in order to preserve the discriminative power of the input space. Thus binary coding methods traditionally suffer from high computation and storage costs in such a scenario. To address this problem, we propose Circulant Binary Embedding (CBE) wh...
متن کاملGene Expression Profile Classification in Random Feature Space
In this study, gene expression profile classification is done via sparse representation in the random feature Space, which is obtained by either random projection or nonlinear random mapping used in Extreme learning machine (ELM). The numerical experiment shows that sparse representation has slightly better performance than ELM.
متن کاملSigned Groups, Sequences, and the Asymptotic Existence of Hadamard Matrices
We use the newly developed theory of signed groups and some known sequences with zero autocorrelation to derive new results on the asymptotic existence of Hadamard matrices. New values of t are obtained such that, for any odd number p, there exists an Hadamard matrix of order 2tp. These include: t = 2N, where N is the number of nonzero digits in the binary expansion of p, and t = 4[-~ log2((p1)...
متن کاملJohnson-Lindenstrauss lemma for circulant matrices
The original proof of Johnson and Lindenstrauss [11] uses (up to a scaling factor) an orthogonal projection onto a random k-dimensional subspace of Rd. We refer also to [7] for a beautiful and selfcontained proof. Later on, this lemma found many applications, especially in design of algorithms, where it sometimes allows to reduce the dimension of the underlying problem essentially and break the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015